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SYSTEM AND METHOD FOR A NEGATIVE ACKNOWLEDGMENT-BASED 
TRANSMISSION CONTROL PROTOCOL 

FIELD OF THE INVENTION 

5 The present invention relates to digital communication methods and systems. In 

particular, the present invention relates to a system and method for providing a transmission control 
protocol (TCP) based on negative acknowledgment. 

BACKGROUND OF THE INVENTION 

10, The Internet continues to grow and to extend to new geographic locations and new 

=D network topologies, using a variety of physical media. The emergence of new network topologies 
g and development of new physical media are parallelled by a continuous quest to improve the 
performance of the Internet. Developed by the U.S. Department of Defense, TCP is the most 
y common transport layer protocol used on the Internet. TCP is built on top of Internet Protocol (IP) 

if and is nearly always seen in the combination TCP/IP (TCP over IP). The general ability of the 
y Internet to scale to new challenges is a testimony to the robustness and solid design behind the 
jij TCP/IP suite of protocols. 

'!g IP, upon which the Internet is based, is the protocol by which data is packaged and 

! ~ sent over the Internet. IP forwards each packet of information based on a 4-byte destination address, 
20 or IP number. Packets are sent to gateway machines, which route them according to their address. 
TCP verifies the transmission of the data. TCP adds reliable communication, flow-control, and 
connection-oriented communication, and provides full-duplex, process-to-process connections. TCP 
is capable of detecting errors and lost data and of signalling re-transmission of flawed packets, and 
provides self-clocking, elastic use of available bandwidth, cooperative congestion avoidance, and 
25 reliable transmission of datagrams. 

By contrast, User Datagram Protocol (UDP) provides an Internet standard network 
layer, transport layer and session layer protocols which provide simple but unreliable datagram 
services. UDP is a connectionless protocol which, like TCP, is layered on top of IP. UDP neither 
guarantees delivery nor does it require a connection. As a result it is lightweight and efficient, but 
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all error processing and re-transmission must be taken care of by the application program. 

TCP Tahoe is the earliest implementation of TCP, and is often referred to as a 
benchmark for performance comparison. Several variants of TCP have emerged including TCP 
Reno, TCP new-Reno, TCP SACK (Selective Acknowledgment), TCP FACK (Forward 
5 Acknowledgment) and TCP Vegas. 

The primary objective of these variants is to improve the performance for an 
individual TCP flow. This objective is achieved by modifying the flow control and congestion 
avoidance algorithms of TCP to maximize end-to-end TCP performance, while ensuring that the 
end-hosts behave like good network citizens who try to utilize unused network bandwidth, but also 
10. back off when they detect network congestion. 

i fl Generally, these prior art TCP schemes use cumulative acknowledgments, also known 

J Ej as ACKs, for error recovery and congestion control. In a simplified example, sequential data packets 
^ are sent from a transmitting host to a receiving host. As each packet is received at the receiving host, 
\d the receiving host sends an acknowledgment back to the transmitting host. If an error, such as a 
15"* dropped packet, has occurred, the transmitting host will not receive an acknowledgment for that 
1 packet. In such a situation, the transmitting host then re-transmits the dropped packet. Where a 
IU number of acknowledgments are not received, TCP assumes that the network is congested, and 
.fi decreases the transmission rate, or congestion window. Recent TCP implementations permit a 
s « receiver to send an acknowledgment that covers multiple received packets. 
20 Recent studies of Internet traffic have shown that a significant percentage (up to 40%) 

of packets on the Internet contain zero bytes of data, or, in other words, contain only control 
information. Since up to 95% of network packets are TCP, it can be assumed that these control 
packets are mainly TCP acknowledgments and other TCP control packets, such as synchronization 
messages (SYN), and finish transmit messages (FIN). If we conservatively assume that two third 
25 of the control packets are acknowledgments, then approximately 26% of all network traffic is 
acknowledgments. This is a significant proportion of the packets on the network. 

While acknowledgments constitute a large percentage of the packets on the Internet, 
it is possible to argue that they do not consume a large percentage of the bandwidth and, therefore, 
do not pose much of a performance concern. However, although the acknowledgments do not utilize 



a significant part of the bandwidth on the network, they can be problematic for a number of reasons. 
Acknowledgments consume significant resources in terms of load network devices, such as routers. 
While the network devices have less processing to do on the small acknowledgment packets, there 
is still a base set of resources consumed by each packet that passes through such devices, regardless 
of packet size. The end result of this is that the router can handle less data packets due to the load 
of control traffic. 

Acknowledgments are susceptible to a phenomenon referred to as acknowledgment 
compression. This causes a bursty TCP sending pattern, and contributes to congestion and degraded 
performance for TCP flows. This is particularly true for asymmetric links, such as wireless, 
Asynchronous Digital Subscriber Line (ADSL) and satellite, where acknowledgments are 
transmitted at lower bandwidth rates than the data sending rates. 

Acknowledgments also pose a problem in the emerging Differentiated Services IP 
Quality of Service (QoS) networks. In such a network, packets are explicitly marked for preferential 
treatment. The presence of an acknowledgment-based scheme implies that priority-marking must 
be performed by edge devices at both the sender and the receiver. This implies the need for double- 
ended Service Level Agreements (SLA), which are not viable on a large scale Internet. Further, it 
raises issues of scalability and an increased level of control traffic. 

Finally, acknowledgments are a significant overhead for the large emerging market 
of low-power wireless devices such as personal digital assistants (PDA). Acknowledgment control 
traffic use valuable and limited resources on such devices. 

It is therefore desirable to provide a system and method to reduce acknowledgment 
traffic generated by TCP. It is further desirable to provide a system and method for error recovery 
and congestion control that does not require acknowledgments. 

SUMMARY OF THE INVENTION 

In a first aspect, the present invention provides a data communications system 
employing transmission control protocol for providing error recovery and congestion control on a 
data communications network. The system includes a transmitter for sending a sequence of data 
packets, and a receiver for receiving the sequence of data packets. The transmitter has a round-trip 



timer that is set upon sending each data packet. The receiver can return a negative acknowledgment 
to the transmitter to cause re-transmission of a missing data packet. A congestion window for the 
network can be adjusted in response to receipt of the negative acknowledgment, or expiry of the 
round-trip timer. The system can further include a missing-packet timer at the receiver that is set 
when a negative acknowledgment is sent, and a re-transmission time-out timer at the transmitter for 
further controlling the congestion window. 

According to a further aspect of the present invention, there is provided a method for 
transmitting data in a data communications network. The method begins by establishing a 
communications link between the transmitter and the receiver. Setting the communications link 
includes setting the congestion window to an initial length. A sequence of data packets is sent from 
the transmitter to the receiver. The receiver detects any missing packets, by examining the sequence 
numbers of the incoming packets, and sends negative acknowledgments, generally no more than 
four, to the transmitter identifying the missing data packet. When the transmitter receives a negative 
acknowledgment, it decreases the length of the congestion window, and re-transmits the missing 
packet. 

In a presently preferred embodiment, the congestion window is halved in response 
to receipt of three duplicate negative acknowledgments. The round-trip timer can also be set upon 
transmitting the data packet, and the congestion window increased, typically doubled, if the round- 
trip timer expires without receipt of a negative acknowledgment. A keep-alive request can also be 
periodically sent from the transmitter to the receiver, and a re-transmission time-out timer set. If an 
acknowledgment to the keep-alive request is not received before expiry of the re-transmission time- 
out timer, the transmitter backs off for a predetermined period. 

In a further aspect, there is provided a method for error recovery in a data 
communications network where data is transmitted as a stream of data packets sent from a 
transmitter to a receiver. After a missing packet is detected at the receiver, negative 
acknowledgments, preferably not more than four for any one missing packet, are sent to the 
transmitter. The missing-packet timer is set at this time, and where the missing packet is not received 
at the receiver in response to the negative acknowledgment before expiry of the missing-packet 
timer, a further negative acknowledgment is sent. The missing-packet timer is cleared upon receipt 



of the missing packet at the receiver. 

In a further aspect of the present invention, there is provided a method for congestion 
control in a data communications network where data is transmitted as a sequence of data packets 
from a transmitter to a receiver. The congestion window is set to an initial size, a data packet is sent 
from the transmitter, and the round-trip timer is set. If no negative acknowledgment is received 
before expiry of the round-trip timer the congestion window is increased. If a negative 
acknowledgment is received, the length of the congestion window is decreased. The round-trip time 
can be estimated empirically by sending a round-trip time update request to the receiver. 

In one embodiment, the congestion window is doubled, and the interval between 
transmission of subsequent data packets is decreased, upon expiry of the round-trip timer. In 
another, the congestion window is multiplicatively increased. Furthermore, a keep-alive request can 
be sent from the transmitter to the receiver, and the re-transmission time-out timer to detect a re- 
transmission time-out. If no acknowledgment of the keep-alive request is received at the transmitter, 
the congestion window is re-set to one for a back-off period. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments of the present invention will now be described, by way of 
example only, with reference to the attached Figures, wherein: 

Figure 1 is a block diagram of a network communications system; 
Figure 2 is a schematic of a typical TCP packet; 

Figure 3 shows transmission of a sequence of TCP packets according to the prior art; 
Figure 4 is shows transmission of a sequence of TCP packets according to the present 

invention; 

Figure 5 is a flow chart of an embodiment of the method of the present invention; 
Figure 6 is a flow chart of the receiver side of the method of Fig. 5; 
Figure 7 is a flow chart of the transmitter side of the method of Fig. 5; and 
Figure 8 is a flow chart of a re-transmission time-out according to the present 

invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

A typical network communication system, generally designated at 10, is shown in 
Fig. 1 . In the example depicted, system 10 consists of a first host A and a second host B connected 
through a network 12, such as the Internet. Hosts A and B are generally located at different 
5 locations, and are running separate application programs, referred to herein as source and destination 
process 14, 16. Source and destination processes 14, 16 are associated with source and destination 
transmission control protocol (TCP) layers which permit the transmission of digital data, and which 
are referred to in this description as transmitter 15 and receiver 17, respectively. As is well known, 
system 10 will typically have multiple hosts connected to the Internet 12 at any one time. To 
l(L communicate and share information, hosts A and B pass messages, in the form of a digital byte 
0 stream, through network 12. To ensure that the messages are correctly and efficiently routed from 
;5 host A to host B, and their applicable processes 14, 16, the messages generally are routed through 
;fj one or more routers 18. A congestion window determines the transmission rate over network 12, 
Uj and is periodically updated to prevent congestion collapse, or to take advantage of increased network 
\f bandwidth. 

^ In order to provide reliable, connection-oriented service, transmitter 15 breaks an 

i!j incoming application byte stream into a sequence of packets 20. Packets are also referred to in the 
! fj art as segments. A typical packet 20 is shown in Fig. 2, and consists of a 20-byte header 22 and, 
;y optionally, some data 24. As used herein, a data packet is a packet that contains data, and a control 
20 packet is a packet that contains only header/control information. Header 22 contains source and 
destination port numbers 26, 28 that identify the source process 14 from which packet 20 is sent, and 
the destination process 16 for which it is intended. Header 22 also includes a sequence number 30 
that identifies the last byte of data sent, an acknowledgment number 32 that is to be returned to the 
source process 14, and other control information 34. 
25 Port numbers 26, 28 are the addresses of transmitter 1 5 and receiver 1 7, at the source 

and destination, respectively. Two processes can communicate by agreeing on the port numbers they 
will use for communications. As used herein, a process is merely the application program currently 
in execution at source or destination, including the instantaneous value of all its variables in memory 
and registers, and the current value of its program counter. 



A 3-way handshake is used to initiate and terminate TCP connections. The 3-way 
handshake is necessary because TCP is layered on an unreliable datagram service provided by the 
internet protocol (IP), such that control packets can be lost, duplicated or delivered out of order. 
This can lead to trouble if original or re-transmitted packets arrive while the connection is being 
established, and the handshake is used to overcome this eventuality. A TCP connection is closed 
using a well known 4-way handshake sequence where FIN control packets are sent and received. 

Fig. 3 shows the transmission, over time, of a sequence of data packets in a typical 
prior art system from a transmitting host A to a receiving host B, each executing respective processes 
14, 16. With reference to Figs. 2 and 3, a sequence of packets numbered 1 to 5 from host A, and 
timers at the transmitter are set for each packet. When, for example, packet 1 is received correct and 
intact at B, a control packet, containing an acknowledgment (indicated by ACK 1), including the 
byte count of the last byte correctly received, is returned to A. When the acknowledgment is 
received at A, transmitter 1 5 at A recognizes that packet 1 has been received, and its respective timer 
is cleared. 

Fig. 3 also shows the sequence of events in the prior art when the network fails to 
deliver a data packet. In this example, packet 4 is dropped by network 1 2 . Since A does not receive 
an acknowledgment control packet (ie. no ACK 4), the timer, which was set when packet 4 was sent, 
times out. When the timer times out, transmitter 15 assumes that packet 4 was lost, and re-sends it 
to B, as indicated by the second arrow labelled 4. Transmitter 15 can potentially deliver duplicated 
packets, and can deliver packets out of order. Generally, TCP buffers or discards out of order or 
duplicated packets appropriately, using the byte count for identification. 

The present invention reduces the acknowledgment traffic of the prior art by 
replacing the prior art acknowledgments, for received data packets, with negative acknowledgments. 
Instead of returning an acknowledgment for each successfully received data packet, a negative 
acknowledgment is returned for packets that are not received. Referring to Fig. 1 , transmitter 1 5 of 
the present invention includes a return trip timer 36 and a re-transmission time-out timer 38, while 
receiver 17 includes a missing-packet timer 40. The basic operation of the system and method of the 
present invention is shown in Fig. 4. As in the previous prior art example shown in Fig. 3, a 
sequence of packets numbered 1 to 5 is sent from host A. However when packets 1 through 3 are 
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received correct and intact at B no acknowledgment is returned to A. Instead, transmitter 1 5, at host 
A, continues to send the sequence of packets at a predetermined rate. Packet 4 is again dropped by 
network 12. When packet 5 is received, receiver 17 recognizes that a packet is missing, set missing- 
packet timer 40, and sends a negative acknowledgment for packet 4 (ie. NACK 4) to transmitter 15. 
5 In this simplified example, transmitter 15 receives the negative acknowledgment and re-transmits 
missing packet 4. When packet 4 is finally received at B, the respective missing-packet timer 40 is 
cleared. 

System 10 generally operates as described in the flow chart of Fig. 5. Fig. 5 includes 
steps 50, 52, 54, 56, 58 and 60 described below. Initially, a communications link is established 
1A* between source process 14 and destination process 16. In a preferred embodiment, the 
; 0 communications link is established by a conventional 3 -way TCP handshake between transmitter 
□ 15 and receiver 17. At the outset, the transmission rate, or congestion window, of the link is set to 
11 an initial length. Transmitter 1 5 then sends a sequence of data packets 20 to receiver 16. The order 
W of the packets 20 is determined by their respective sequence numbers 30. Receiver 1 7 monitors the 
l§ sequence numbers 30 of the incoming packets, and detects any missing, or out of sequence, packets. 
[% If receiver 17 detects that a data packet is missing, a negative acknowledgment is sent back, or 
returned, to transmitter 15. The negative acknowledgment is contained within a control packet or 
=fl piggybacked on a data packet, and identifies the missing data packet. In response to the success rate 
k " of packet transmission, transmitter 15 adjusts the size of the congestion. For example, the receipt 
20 of a negative acknowledgment signals to transmitter 15 that the packet may have been lost due to 
network congestion. Therefore, the length of the congestion window, or transmission rate over the 
network, is reduced. Conversely, if no negative acknowledgments are received at transmitter 15 in 
a particular time period, it assumes that the transmission rate can be increased, and adjusts the 
congestion window appropriately. The adjustment of the congestion window is further described 
25 below. Finally, in response to a negative acknowledgment is re-transmitted by transmitter 15. 

The operation of system 10 at the receiver end is illustrated in the flow chart of Fig. 
6, at steps 62, 64, 66, 68, 70, 72, 74, 76 and 78. Initially, a conventional opening 3-way handshake 
is performed. This retains the states used in prior art implementations of TCP. Thus, it is possible 
to extend operation of the present method to networks with deployed hosts using a variety of TCP 



-9- 

variants. Once the handshake has been completed and transmitter 1 5 begins to send a sequence of 
packets 20 to receiver 1 7, receiver 1 7 monitors the sequence numbers of incoming packets. Receiver 
17 does not respond to any received packets unless it senses a gap between the sequence numbers 
30 of packets 20 that have been received. Gaps in sequence numbers 30 are sensed on arrival of out- 

5 of-order packets, or dropped packets. On detection of a gap, receiver 1 7 sets missing-packet timer 
40 3 and returns a negative acknowledgment to the sender. Receiver 17 then continues to send 
negative acknowledgments upon receipt of successive packets until it has either successfully 
received the missing packet, or, in a presently preferred embodiment, it has sent a predetermined 
number of negative acknowledgments, such as four, for the missing packet. The negative 
l(L acknowledgments are again contained within a control packet or piggybacked on a data packet, and 

; J3 identify the missing data packet. If the missing packet successfully arrives at receiver 17, timer 40 

□ is cleared. 

jfj; For ease of comprehension of the drawing, Fig. 6 does not show that receiver 17 

W continues to monitor the sequence numbers of incoming packets to detect further gaps after it has 
15 returned the negative acknowledgment and before missing-packet timer 40 is cleared, or before 
;tf sending the final acknowledgment at step 78. However, as will be clear to those of skill in the art, 
!U after step 70, receiver 1 7 continues to monitor the incoming sequence of data packets as they arrive 
; p to detect further missing packets, and repeats the process in parallel from step 66. 
y Where network 1 2 is congested, resulting in dropped packets in both directions, it is 

20 possible that the negative acknowledgments sent by receiver 1 7 will also be dropped. In a presently 
preferred embodiment, if four negative acknowledgments are sent to transmitter 15 for a single 
packet, receiver 17 waits until missing-packet timer 40 expires, at which time it returns a final 
negative acknowledgment to transmitter 15, as shown at steps 76 and 80. 

The initial duration of missing-packet timer 40 can either be an estimate of the round- 
25 trip time or some fixed delay value, such as one second. Picking a large fixed delay as the initial 
timer value instead of a round-trip estimation is easier and less complex to implement. The setting 
for missing-packet timer 40 can be adjusted according to changes in the estimated or measured 
round-trip time as the transmission progresses. 

The general operation of system 10 at the transmitter end is shown in the flow chart 
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of Fig. 7, at steps 90, 92, 94, 96, 98, 100, 102, and 104. A communications link between transmitter 
15 and receiver 17 is initiated with a conventional 3 -way handshake, as described above. 
Transmitter 15 then determines the round-trip time over network 12. The manner in which these 
times can be determined is described in greater detail below. A sequence of packets 20 is then sent 
5 to receiver 1 5 , and a round-trip timer 3 6 and re-transmission time-out timer 3 8, is set for each packet. 
When the round-trip timer 36 for each packet expires, the size of the congestion window is adjusted. 
Generally, in response to receipt of negative acknowledgments at transmitter 15, the transmission 
rate is decreased. Where no negative acknowledgments are received in a predetermined round-trip 
time, the transmission rate is increased. If no negative acknowledgments are received during the 
1 : CL round-trip time, transmitter 15 assumes packet transmission was successful and increases the 
5 o congestion window. The transmitter 1 5 then returns to step 92, where the round-trip time is updated, 
; Ej as necessary, and continues to inject packets into network 12. This process continues until there are 
!i! no more data packets to transmit, or until a re-transmission time-out occurs, as will be described 
ijj below. 

if As shown at steps 102 and 104, the present invention adjusts the size of the 

y congestion window, or transmission rate, whenever a negative acknowledgment is received at 
ill transmitter 1 5 . The adjustment ensures that packets 20 are injected into network 1 2 in a controlled 
: S fashion to avoid congestion collapse. Multiplicative increase during slow-start, and additive increase 
sy and multiplicative decrease during congestion avoidance, can be used, in conjunction with an 
20 inferred round-trip time to set the rate of transmission and to permit transmitter 15 to elastically 
utilize free network bandwidth and back off in the presence of sensed network congestion. In the 
example shown in Fig. 7, adjustment of the congestion window occurs after each packet is sent and 
round-trip timer 36 expires. In reality, receipt of a negative acknowledgment at any time at 
transmitter 15 will result in adjustment of the congestion window length, as shown at steps 100 to 
25 1 04. 

When the connection is first established, as in other implementations of TCP, 
transmitter 15 starts in a slow-start mode, and a single packet is sent to receiver 17. Alternatively, 
an increased initial window size, as is proposed for TCP Reno, can be used. Again, if no negative 
acknowledgments are received during the round-trip time, transmitter 15 assumes packet 
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transmission was successful and increases the congestion window. Typically, in slow start, the 
congestion window is doubled in size. 

Generally, the inventors of the present invention contemplate that the congestion 
window can be increased in one of three ways after start up. The choice of the congestion window 
increase method to be used depends on the level of accuracy desired for congestion control. In the 
simplest embodiment, if no negative acknowledgment is received on expiry of the round-trip timer 
36 set for a packet, the congestion window is doubled, and all queued packets are sent in a 
continuous sequence. If no negative acknowledgments are received for these packets, the congestion 
window is again doubled at the expiry of each round-trip timer 38, and the process repeats. 

In another embodiment, the congestion window is doubled on the expiry of each 
round- trip timer 36. However, rather than transmitting the queued packets in a burst, they are 
transmitted with some minimal interval between each packet. The interval time between packets can 
be chosen as desired. For example, the interval can be chosen such that transmission of all packets 
is completed during one round-trip time. 

In a further embodiment multiplicative increase is achieved. A single packet is 
transmitted and round-trip timer 36 is set, as described above. If no negative acknowledgment is 
received for the first packet, then second and third packets are transmitted in staggered fashion at an 
interval calculated according to: 

Interval = (RTT/cwin)*(l/a) 
where RTT is the round-trip time; cwin is the length of the congestion window; and a is the number 
of packets being sent in a staggered fashion. A round-trip timer 36 is set for each of the second and 
third packets when they are transmitted. If no negative acknowledgment is received on expiry of 
timer 36 set for the second packet, the size of the congestion window is doubled, and four more 
packets are sent in a staggered manner, and respective timers 36 are set. On expiry of the timer 36 
set for the third packet where no negative acknowledgment is returned, the congestion window is 
again increased by a factor of two. This results in a congestion window four times the original 
length. Eight packets are then sent in a staggered fashion according to the above equation, and a 
round-trip timer 36 is set for each one. The congestion window continues to be doubled if no 
negative acknowledgments are received at the expiry of each timer, and the number of packets sent 
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is doubled accordingly. 

In a presently preferred embodiment, regardless of the method used to increase the 
congestion window, on receipt of three duplicate negative acknowledgments, the length of the 
congestion window is halved, and set as a threshold value. 

Accurately inferring the round-trip time can be important since it can play a key role 
in adjusting the transmission rate, as described above. As is known to those of skill in the art, in the 
prior art round-trip time is generally calculated by a method developed by Van Jacobson. Van 
Jacobson' s method uses receipt time of acknowledgments for each packet to measure the round-trip 
time per packet, and then calculates a smoothed running round-trip time estimator. The re- 
transmission time-out can then be calculated based on the smoothed round-trip time estimator, plus 
four times a mean deviation estimator. The calculation of the re-transmission time-out is not 
performed for every packet transmitted, but, in accordance with Karn's algorithm, only when a 
packet is transmitted for the first time. 

In the present invention, the initial round-trip time can be estimated, as in the prior 
art, during the synchronization phase of the 3-way handshake. However, since the round-trip time 
can vary over the life of a connection, it is preferable that it be updated during the network session 
on an ongoing basis. In a presently preferred embodiment, the round-trip time is updated by 
periodically setting a TCP option on an outgoing data packet requesting a round-trip time update. 
On detecting this round-trip time update request, receiver 15 responds with a forced 
acknowledgment, a round-trip time update acknowledgment, the purpose of which is to measure the 
current round-trip time. This permits the use of Van Jacobson's method for calculating the round-trip 
time. 

As will be appreciated by those of skill in the art, the window size of receiver 1 7 must 
also be periodically provided to transmitter 15. In the event that data packets are sent from receiver 
17 to transmitter 15, this information can be piggy-backed on such packets. In the absence of data 
packets from receiver 17, receiver 17 can periodically generate window update acknowledgments 
to transmitter 1 5 . Two distinct window update acknowledgments can be generated. The first can be 
a timed window update packet which is generated by receiver 1 7 at regular intervals. The second can 
be a forced window update packet which is generated by receiver 17 whenever the receive window 
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drops below a certain threshold. With the cheap price of memory, the receiver windows on end-hosts 
can be large. Therefore, it is currently contemplated that sending forced window update 
acknowledgments should be sufficient. 

Generally, re-transmission time-outs are required because individual negative 
acknowledgments may not reach transmitter 15. Negative acknowledgments like all other packets 
are susceptible to loss in routers 19, route flapping and physical connection failure. On a re- 
transmission time-out, transmitter 1 5 backs off for a predetermined period, ie. it goes into slow-start 
and sets its congestion window to one. Transmitter 1 5 then operates in multiplicative increase mode 
until it hits a threshold value, at which point it reverts to a linear increase for the congestion window. 
The threshold value can either be threshold value set when transmitter 15 receives three duplicate 
negative acknowledgments, as described above. Or, if such a threshold value has not been previously 
set, the threshold value is set, on detection of a re-transmission time-out, as half the value of the 
current congestion window length. An exponential back-off for re-transmission time-out can also 
be implemented, as in the prior art. In the presence of successive re-transmission time-outs, the back- 
off duration is doubled for each re-transmission time-out. The re-transmission time-out can be 
calculated in the same manner as in TCP Reno, or as described below. 

Since, in the system of the present invention, negative acknowledgments are only sent 
when receiver 17 detects problems, no negative acknowledgment over a long period of time can 
either indicate a serious problem because the negative acknowledgments are not getting through or 
can indicate that the network is lightly loaded and has not yet encountered a problem serious enough 
to warrant a negative acknowledgment. Since transmitter 1 5 cannot can not rely on the presence or 
absence of negative acknowledgments for re-transmission time-out detection, it can periodically 
generate keep-alive requests on a much finer granularity than a keep-alive timer in the prior art, 
which is in the range of one second. Receiver 17 can generate an acknowledgment in response to the 
keep-alive request. For example, as shown in Fig. 8 at steps 1 10, 112, 1 14, 116, and 118, transmitter 
15 periodically transmits a keep-alive request, typically piggy-backed on an outgoing data packet, 
and set re-transmission time-out timer 38. If an acknowledgment for the keep-alive request is 
returned within the re-transmission time-out interval, the re-transmission time-out timer 38 is 
cleared, and the process repeats at predetermined periods. If no acknowledgment is received at 
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transmitter 15 within the re-transmission time-out interval, a re-transmission time-out occurs, and 
transmitter 15 backs off for a predetermined period, preferably according to the exponential back-off 
algorithm. On return from back-off, the process repeats. Generally, no packets are re-transmitted 
unless negative acknowledgments are received from receiver 1 7, and packet transmission continues 
with the next scheduled packet. Packets lost in network 12 will be negatively acknowledged when 
receiver 17 determines which, if any, packets were lost. 

To end transmission, the present invention uses a 4-way closing handshake, as in the 
prior art. While receiver 17 has no way of detecting lost finish messages, transmitter 1 5 can detect 
the lack of an acknowledgment for the finish request, and will resend the finish request. 

In a further embodiment of the present invention, it is contemplated that negative 
acknowledgments can be used to convey information about a missing range of packets as opposed 
to just one packet. However, this necessitates a more complex congestion avoidance scheme. 

The present invention eliminates the large number of acknowledgments generated by 
typical TCP hosts to acknowledge receipt of data packets. However, it introduces several new 
messages, including the negative acknowledgment; forced, periodic or requested window update 
request; keep-alive request, return trip time update request. To reduce the overhead consumed by the 
above control messages, it is contemplated that the additional messages sent by transmitter 15 can 
be piggy-backed onto data packets, and that responses from receiver 17 can be piggy-backed onto 
negative acknowledgments where available. If no negative acknowledgments are available, the 
receiver responses can be combined into a single packet where applicable. It is believed that this 
will sufficiently reduce the amount of out-of-band traffic required to implement the present 
invention. 

Inter-operation between the present invention and heterogeneous hosts running 
different variants of TCP can be achieved by simple modification to the opening 3 -way handshake. 
TCP option fields can be set that permit the transmitter and receiver to negotiate use of an 
appropriately supported version of TCP. 

The main advantage of the present invention is that it introduces fewer control packets 
on the network than prior art implementations of TCP. It is estimated that under the present 
invention the ratio of negative acknowledgment to data packets will be proportional to the network 
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loss rate. A study of the Internet showed sample loss rates of 5% . In such an environment, one 
would expect only 5 - 10% of network packets to be negative acknowledgments as opposed to the 
current scheme where roughly 26% of network packets are acknowledgments. The present invention 
can be particularly beneficial to congested routers, asymmetric links, emerging Diffserv QoS 
networks and low power devices in a wireless environment. 

The present invention retains reliable data transfer and congestion/flow control, which 
are the hallmarks of TCP. The receiver generates negative acknowledgments only for missing 
packets, while providing for additive increase on high bandwidth networks, and multiplicative 
decrease on inference of upcoming congestion. 

A further advantage of the present invention is that, in general, the prior art 
congestion control principles are retained. Only minor modifications are required to permit the 
transmitter to elastically use free network bandwidth, and to back-off in the presence of sensed 
network congestion. Additionally, as other congestion avoidance algorithms are developed, they can 
be integrated with the present invention. 

The above-described embodiments of the invention are intended to be examples of 
the present invention and alterations and modifications may be effected thereto, by those of skill in 
the art, without departing from the scope of the invention which is defined solely by the claims 
appended hereto. 
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We claim: 

1 . A method for transmitting data in a data communications network, comprising the steps of: 

(i) establishing a communications link between a transmitter and a receiver, the 
communications link having a congestion window set to an initial length; 

(ii) transmitting data packets from the transmitter to the receiver; 

(iii) detecting a missing packet at the receiver; 

(iv) sending a negative acknowledgment from the receiver to the transmitter for the 
missing data packet; 

(v) decreasing the length of the congestion window in response to the negative 
acknowledgment; and 

(vi) re-transmitting the missing packet. 

2. A method according to claim 1 , wherein up to four duplicate negative acknowledgments are 
sent from the receiver. 

3. A method according to claim 1, wherein the congestion window is halved at step (v). 

4. A method according to claim 1, further including a step of setting a round- trip timer at the 
transmitter upon transmitting the data packet. 

5. A method according to claim 4, wherein the congestion window is increased upon expiry of 
the return trip timer. 

6. A method according to claim 5, wherein the congestion window is doubled. 

7. A method according to claim 1, wherein a keep-alive request is periodically sent from the 
transmitter to the receiver, whereupon a re-transmission time-out timer is set. 

8. A method according to claim 7, wherein the transmitter determines if an acknowledgment 
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to the keep-alive request is not received before expiry of the re-transmission time-out timer, 
whereupon the transmitter backs off for a predetermined period. 

9. A method according to claim 1 , wherein the congestion window is decreased in response to 
5 three duplicate negative acknowledgments. 

10. A method according to claim 1, wherein the data communications network in an internet. 

11. A method for error recovery in a data communications network where data is transmitted as 
10 a sequence of data packets sent from a transmitter to a receiver, comprising the steps of: 

•o (i) detecting a missing packet at the receiver; 

% (ii) sending a negative acknowledgment from the receiver to the transmitter for the 

^ missing packet; 

ii| (iii) setting a missing-packet timer at the receiver upon sending the negative 

\y acknowledgment; and 

! 3 (iv) where the missing packet is not received at the receiver in response to the negative 

iy acknowledgment before expiry of the missing-packet timer, sending a further negative 
^ acknowledgment. 

20 12. An error recovery method according to claim 1 1, wherein the missing packet is detected 
according to a gap in sequence numbers of the stream of data packets. 

13. An error recovery method according to claim 11, wherein up to four negative 
acknowledgments are sent from the receiver to the transmitter before expiry of the missing-packet 
25 timer. 



14. An error recovery method according to claim 1 1 , wherein the missing-packet timer is cleared 
upon receipt of the missing packet at the receiver. 
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15. A method for congestion control in a data communications network where data is transmitted 
as a sequence of data packets from a transmitter to a receiver, comprising the steps of: 

(i) setting a congestion window to an initial size, the congestion window relating to a 
transmission rate over the network; 

(ii) transmitting a data packet from the transmitter to the receiver; 

(iii) setting a round-trip timer at the transmitter upon sending the packet; 

(iv) increasing the congestion window if no negative acknowledgment for the data packet 
is received before expiry of the round-trip timer; and 

(v) decreasing the length of the congestion window if a negative acknowledgment for the 
data packet is received at the transmitter. 

16. A congestion control method according to claim 15, further including a step of empirically 
determining the round-trip time. 

17. A congestion control method according to claim 1 6, wherein a round- trip time update request 
is sent to the receiver. 

18. A congestion control method according to claim 15, wherein the congestion window is 
doubled, and an interval between transmission of subsequent data packets is decreased, upon expiry 
of the round-trip timer. 

19. A congestion control method according to claim 15, wherein the congestion window is 
multiplicatively increased. 

20. A congestion control method according to claim 15, further including steps of sending a 
keep-alive request from the transmitter to the receiver, and setting a re-transmission time-out timer 
to detect a re-transmission time-out. 



21. A congestion control method according to claim 20, wherein the congestion window is set 
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to one for a back-off period if no acknowledgment is received in response to the keep-alive request, 
before expiry of the re-transmission time-out timer. 



22. A data communications system employing transmission control protocol for providing error 
5 recovery and congestion control on a data communications network, comprising: 

a transmitter for sending a sequence of data packets, the transmitter having a round- trip timer 
that is set upon sending each data packet; 

a receiver for receiving the sequence of data packets, the receiver detecting a missing packet 
in the sequence of data packets, and returning a negative acknowledgment for the missing data 
1Q„ packet to the transmitter to cause re-transmission of the missing data packet; and 
:D means for adjusting a congestion window in response to receipt of the negative 

;EJ acknowledgment, and expiry of the round-trip timer. 

! d 23 . A system according to claim 2 1 , further including a missing-packet timer at the receiver upon 
15" expiry of which a final negative acknowledgment is sent to the transmitter. 

|l| 24. A system according to claim 21, further including a re-transmission time-out timer at the 
transmitter, the means for adjusting responding to expiry of the re-transmission time-out timer. 

20 



25 
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ABSTRACT 

A system and method for transmitting data in a data communications network, using 
a transmission control protocol, to provide reduced acknowledgment control traffic, error recovery 
and congestion control. A communications link is established between a transmitter and a receiver. 
Setting the communications link includes setting a network congestion window to an initial length. 
A sequence, or stream, of data packets is sent from the transmitter to the receiver. The receiver 
detects any missing packets, by examining the sequence numbers of the incoming packets, and sends 
negative acknowledgments, generally no more than four, to the transmitter identifying the missing 
data packet. When the transmitter receives a negative acknowledgment, it decreases the length of 
the congestion window, and re-transmits the missing packet. Detection and use of round-trip time, 
re-transmission time-out are provided. 
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